Using neural networks to locate pitch accents
نویسنده
چکیده
This paper descirbes a technique for finding intonatioanl events, (pitch accents and boundary tones) from waveforms. The technique works in a bottom-up manner by using a recurrent neural network to perform a classification of each frame in the input waveform. An autosegmental description, consisting of intonational events, syllables and the links between them, is then produced from this frame-based classification. The technique correctly identifies 85.7% of pitch accents and boundary tones.
منابع مشابه
Using Neural Networks and Genetic Algorithms for Modelling and Multi-objective Optimal Heat Exchange through a Tube Bank
In this study, by using a multi-objective optimization technique, the optimal design points of forced convective heat transfer in tubular arrangements were predicted upon the size, pitch and geometric configurations of a tube bank. In this way, the main concern of the study is focused on calculating the most favorable geometric characters which may gain to a maximum heat exchange as well as a m...
متن کاملAn investigation of acoustic events related to sentential stress and pitch accents, in English
An algorithm is described to abstract acoustic parameters of a speech waveform to give a scalar measure of the relative stress and pitch movement of each group of phones which can consist of a single prominence. A method of identify such groups using acoustic information is given. The abstracted parameters are used to locate sentential stress and pitch accents in English speech. These are compa...
متن کاملIntegration of Color Features and Artificial Neural Networks for In-field Recognition of Saffron Flower
ABSTRACT-Manual harvesting of saffron as a laborious and exhausting job; it not only raises production costs, but also reduces the quality due to contaminations. Saffron quality could be enhanced if automated harvesting is substituted. As the main step towards designing a saffron harvester robot, an appropriate algorithm was developed in this study based on image processing techniques to recogn...
متن کاملAn Analysis of the Pitch Contour of an English Declarative Question Read Aloud by Chinese EFL Learners
The pitch contour of an English declarative question read aloud by 12 Chinese EFL learners is labeled and analyzed by means of the phonetic software “praat” on the basis of the English intonation grammar proposed by Pierrehumbert. The careful study yields to two findings. 1) Most Chinese EFL learners can locate correctly the stresses of the target sentence and attach legal tonal events to the p...
متن کاملProsodic Event Recognition Using Convolutional Neural Networks with Context Information
This paper demonstrates the potential of convolutional neural networks (CNN) for detecting and classifying prosodic events on words, specifically pitch accents and phrase boundary tones, from frame-based acoustic features. Typical approaches use not only feature representations of the word in question but also its surrounding context. We show that adding position features indicating the current...
متن کامل